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Abstract. We study the Cauchy problem in a special case of non- linear massive 

gravity: the two-tensor "/-<?" theory. Despite being ghost-free, it has recently been 

argued that the theory is inherently problematic due to the existence of supcrluminal 

$} . shock waves. Furthermore it is claimed that acausal characteristic can arise for any 

i-Q | choice of background. In order to further understand the causal structure of the theory, 

we carefully perform a detailed analysis of the characteristic equations and show that 
the theory does admit a well-posed Cauchy problem, i.e., there exist hypersurfaces 
that are not characteristic hypersurfacc. Puzzles remain regarding the existence of 
a supcrluminal propagating mode in both the f-g theory, as well as in the full non- 

fS| \ linear massive gravity. That is, our result should not be taken as any indication of 

the healthiness of the theory. We also give a detailed review of Cauchy-Kovalcvskaya 
theorem and its application in the Appendix, which should be useful for investigating 

S^ ' causal structures of other theories of gravity. 
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1. Introduction 

The question as to whether graviton can have a mass is an old one. A recent review 
of the literature can be found in Ref . [1] . Here we give a very brief review of the main 
progress and major difficulties encountered in constructing a healthy massive gravity 
theory. 

It is well-known that Fierz and Pauli constructed a theory of a free massive spin-2 
particle back in 1939 [2], in which the quadratic form of the action can be uniquely 
fixed as linearized general relativity (GR) with the Fierz-Pauli mass term, by imposing 
the tachyon-free and ghost-free conditions [3]. However, it was discovered in 1970 that 
this linear massive spin-2 theory suffers from the van Dam-Veltman-Zakharov (vDVZ) 
discontinuity jH |5] , with a massless limit that does not recover that of linearized general 
relativity. In particular, the prediction of light-bending near massive objects was off by 
25% and is therefore completely ruled out by observations. The Vainshtein mechanism 
was proposed in 1972, in which a non-linear effect is introduced to force the massless 
limit to recover GR [6j [7] . However in the same year it was shown that the exact non- 
linearity introduced gives rise to the Boulware-Deser ghost or BD ghost for short [8], 
where the mass of the ghost mode is typically the same as that of the graviton, which 
means the ghost mode cannot be ignored (e.g., by pushing to the Planck scale to be 
delt with by a quantum theory of gravity [9]) since its mass is expected to be small. 
The BD ghost is essentially due to an extra degree of freedom reinstated by the non- 
linearity, beyond the linearized 5 degrees of freedom. The interest in massive gravity was 
revived recently due to the construction of a presumably consistent non-linear massive 
gravity without the BD ghost jTUJ [TT] . The absence of the BD ghost even at fully non- 
linear orders was subsequently shown [T2J [131 El US]- It is interesting to note that, as 
pointed out in [16], Wess-Zumino already constructed a theory without the 6th degree 
of freedom and thus free of the BD ghost in 1970 [U]. We shall henceforth refer to such 
presumably consistent theories as non-linear massive gravity in general. In non-linear 
massive gravity, in addition to the dynamical metric g^, there is also fixed background 
with non-dynamical metric g^, also known as the fiducial metric. 

Recently, the characteristic equations of a special case of non-linear massive gravity, 
essentially the Wess-Zumino model with 5 degrees of freedom, were analyzed by Deser 
and Waldron [16] (see also the newer preprint |18j). It was argued that the model 
admits superluminal shock wave solutions and therefore is acausal, and in addition, 
the superluminal shocks are only absent in the special case where the contortion term 
vanishes. Furthermore, ironically this originates from the very constraint that removes 
the BD ghost. In the same work, it is mentioned that this fatal problem arises in any 
choice of background. This is an intriguing claim, which if true, is a rather strong one. 
After all, we do know that at least some physics does depend on the choice of background. 
For example, a FLRW (Friedmann-Lemaitre-Robertson- Walker) background does not 
admit the flat or closed cosmologjlj] [IH] , however more general fiducial metrics do allow 

% This is also caused by the very constraint that removes the BD ghost. 
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FLRW universes of all signs of spatial curvature [20] • Motivated by the interesting 
work of Ref.[16j. we perform a careful analysis of the characteristic equations to further 
understand the causal structure of non-linear massive gravity. 

In section[2], we introduce the necessary set-up. Section [3] contains the main analysis 
of the characteristic equations, while section H] gives a concrete example to show a well- 
posed Cauchy problem in the theory. We conclude with a discussion on the existence 
of superluminal propagation in non-linear massive gravity in section |5j The method of 



characteristic employed in our work is reviewed in detail in Appendix A 

2. Non-Linear Massive Gravity Theory 

As mentioned in the introduction, in non-linear massive gravity, we need to first choose 
and fix a background with a non-dynamical reference metric g^, in addition to the 
physical metric g^ v . For convenience of our analysis, we will work in the tetrad formalism 
|2H 1221 [231 121 [25], following the work of Ref.pjjl We denote the background tetrad 
by f a , and the dynamical tetrad by e a . Here Greek letters correspond to spacetime 
indices, while Latin letters label tetrad vectors. The theory is also referred to as the 
two-metric "f-g" theory [16]. 

The Levi-Civita connection co^e)^™ that corresponds to the dynamical tetrad 
is torsion-less. However, the same connection employed as the connection for the 
background tetrad has nonzero torsion in general. The difference of the two connections 
is the contortion tensor [jj]: 

ff M ™:=w( e )^-w(/) M ™. (1) 

The background metric g^ u and the dynamical metric g^ v are related to the background 
tetrad and the dynamical tetrad, respectively, by 

f^tuVab = g^u and e a ll e b l/ r] ab = g^. (2) 

Raising and lowering of the Greek indices are performed via the dynamical metric, i.e., 

V = V v g v ' i , and V lt = V v g v , l . (3) 

The action of the non-linear massive gravity theory is defined as 

If. /_ A 



S=— I d i xe\R + 2}_^a n £ n \, (4) 

in the unit c = 1, where the a n 's are constant parameters of the theory and the exact 
expressions of the £ n 's are 

C = 1, (5) 

Ci = /, (6) 

a = f - ft/;, (7) 

§ The massive gravity theory in the tetrad formalism is, at least in classical physics, the same as the 
non- linear massive gravity proposed in Refs.|12 t [T3 t 114 } IT5 J . 
| We would like to remark that the proper term is contortion instead of "contorsion" . See e.g., [26]. 
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C 3 = f - 3//t/; + 2/tA/^ (8) 

u = / 4 - e/vt/; + %fFuf\f\ + 3(n/;) 2 - 6/tr A /VV ( 9 ) 

Here, /^ = e a J b u r] ab and / = /^. 

For simplicity, we consider the case where «2 = 03 = 04 = 0. The equation of 
motion can be obtained by the variation with respect to the dynamical tetrad e a : 

Gw ■= Giiv ~ a fl> + an(f vlt - fg^u) = 0, (10) 

where G^ is the Einstein tensor. Here, in order to have a simpler form of equation of 
motion, we have acted upon the expression obtained after taking variation with respect 
to e a by —e afl g au . We emphasize that f^ v has all of the information of the dynamical 
tetrad e a , and thus we analyze the dynamics of f^ instead of that of e a ^. From the 
antisymmetric components of Eg. ([TO]) , we find that f^ u is a symmetric tensor and thus 
has ten degrees of freedom. 

Equation ( TTOj) includes not only dynamical equations but also five constraint 
equations, which we shall now derive. The first two terms in Eq.f fTOl are divergence 
free, and thus, taking the divergence of Eq.f fTOj) gives four constraint equations: 

C,:=V7„ + V,/ = 0. (11) 

Another constraint equation stems from the nontrivial combination of the trace of 
Eq.flTOD and Eg. ([IT]) ; it is 

C:=2V,(l^C u )-g^ = 0, (12) 

where V*" = l fl a e ai/ and l^ a is the inverse of the background tetrad f a . From the symmetry 
of the indices of f^, we can infer that the indices of l^ u are also symmetric. In the 
combination ( fl2|) . the terms involving second order derivatives of f^ are canceled out, 
and it becomes a constraint equation. To sum up, the ten symmetric components of 
Eq.( TT0l) include the five constraint equations given by Eq.f llll) and Eq.( TT2|) . and five 
dynamical components containing the second order derivatives of f^ v . 

3. Characteristic in Non-Linear Massive Gravity 

Suppose we have a hypersurface £ and £ M is the unit normal vector to E. Here, we 
consider the case where £ M is timelike. We use Gaussian normal coordinate near the 
hypersurface S, 

ds 2 = -(dx ) 2 + h ij dx i dx j 1 with (-^J = £". (13) 

In Eq. dTUj) . all second derivatives of f^ u are of the form d^d[ v f a ^ where the bracket [■•] 
represents antisymmetrization, i.e., 

Tq,..^]... := - {T a ...p T .. - T a ... 7 p...) . (14) 

Having in mind the Cauchy-Kovalevskaya theorem [23 f2B] (See Appendix A for a 
detailed discussion), we shall introduce new variables 

M a(3y := d [a f p]l . (15) 



M 0i0 = 


-M i00 -- 


- 2 (^o/io " 


- dij oo 


M 0ij = 


-M i0j = 


= 2 Wa - 


_ difoj, 


M ij0 = 


-M jiQ = 


= 2 Wjo - 


- 9jfio J 


M ijk = 


-M jlk -- 


= 2 (Qifjk ' 


- djfik 
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Since M Q/ g 7 is antisymmetric with respect to a and (3, we naively expect that M a/3 ^ has 
24 degrees of freedom. There are, however, 4 constraints for M a ^, namely, 

e^M a(3y = 0, (16) 

where e^ a ^ is the Levi-Civita symbol. This fact can be deduced from the definition (fT5l) 
of M a ^. Therefore the actual number of degrees of freedom of M a ^ is only 20. Under 
3+1 decomposition of spacetime based on the metric (fl3|) . all non- vanishing components 
can be written as 

1 '" " " " ' (17) 

(18) 
(19) 

(20) 

Using the four constraints ([TBI) for M a ^, we can express three of M 0i j and one of M^k by 
the other M Q/ g 7 's. Thus, after careful comparison it can be deduced that Mojo, Moij, M^o 
and M^k have three, six, three and eight degrees of freedom respectively. In addition 
to the four constraints (1161) . we can derive other relations among the M Q/ 3 7 's. They are: 

d M lj0 = diM 0j0 - djMoio, (21) 

8 M ijk = diM 0]k - djMoik. (22) 

Equations ( TT71) . ( TPS1) . ( 12TI) and ( 1221) can be regarded as the time-evolution equations 
for /j , /jj, Mjjo and Mj^ respectively. All coefficients of the terms with a x°- derivative 
in these equations are obviously nonzero. Therefore, the hypersurface £ is not a 
characteristic surface for /jo, fy, M^o and Mjjfe. 

Now, in order for the time evolution to be well-defined and solvable, we need to 
find appropriate equations to fix the time derivatives of the ten variables: /oo, M 0i0 and 
M 0i j. We have the ten components of the equation of motion (ITU]) , and so, should expect 
that the time derivatives of these ten variables can be obtained from them. 

Having introduced M a ^, the equation of motion ( flOl) can be written explicitly up 
to first order derivatives of f^ u and M a ^ 7 as 

2 (g a n\/ v) + /^(T } - <7 Q ^V) d a M M5 + F, v 
+ l\ (dsf\) (l X0 S%6 5 v) - l xs 5\/ y) + g a5 l\/ y) 

- 9 ap i\/u) - g a5 i xp g,u + g aP i xs g,u)d a f M 

+ Ns^g^l^ - 2lH x5 5\5\ ) + 2l*lH\ ll 8 a lj) + 2g 5 ^l\^ u) 
- {2g a ^ 5 l x * + g Sa l x ^ + g 5 n^l Xa } 9llv )d a f M = 0, (23) 



where 



N SeX :=M Se x + e a[s d e] f a x , (24) 
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and the brackets (••) denotes symmetrization, i.e., 

T a ...(/3-y)- :— - (T a ...p T .. + T a ... 1/3 ...) , (25) 

and F^ u is a function of f a p, M^ uX , f a a , d^f a a and d^d v f a a . Note that derivatives of 
the dynamical variables f a p and M^x do not appear in F^ v and that derivatives of f a a 
are not dynamical but fixed variables because f a a is the given background tetrad. The 
constraint equations (ITT]) and ( JT2J) can be written without derivatives of the dynamical 
variables f a p and M^x as 

l\C v = 2l^N^ + 2l^e a a d [a r m = 0, (26) 

B:=C-C^C V 

= 4a + 3ai/ - g^{2f x P v + g Pu g^)N, u xN aM 

+ efe^RU)^ - 2N v/n (2P^e bp ^ b + e^l^xpb) 

+ (2n ab l^ a + g va f\e bX l^) JF^JF^ (27) 

with 

F a p c '■= dwf c p]i ( 28 ) 

where, in accord with Ref.[16], we use the combination ( 1271) of the constraint equations 
instead of C. Here -R(/)„jf 6 is the Riemann tensor corresponding to the background 
vierbein f a . Acted upon by the differential operator do, the two constraint equations 
( |26l) and ( 1271) now become time evolution equations. 

Now, we have the equations (1241) . cV^Cq, and <9 i3 which are expected to describe 
the time evolution of /oo 5 Mdjo and M^j. These equations are quasi-lineaiqj for the first 
order derivatives of f^ u and M^ uX . Using the time-evolution equations for f 0i , fij, M^ 
and Mijk, i.e., Eqs.( ll7p . ( TTH1) . ( 12TT) and ( |22l) . we can eliminate their time derivatives from 
Eq. (l2TI) . dol^Ca and <9 £>, and consequently only the time derivatives of f 00 , M 0i0 and 
M 0i j appear in these equations. The forms of the equations become 

E, u := 2 (Z 00 (VV } - 9 m l\^ v) - l m 8\^ v) + g 00 l°*g» u ) d M 0l0 

+ [2N aPl (-l^5\/ u) +g°r*S^ll ) +g^m\/ u) 
- r<l Q WX - g 0a l°P^g^ + g 00 l a n 0l3 g^ 



+ l\ (dsfx) (l X °5V u) - l^6\ + 9° 5 l\A) 

dofoo 



,00; A e<5 „0<5;A0„ , „00;A<5, 



= F;f 9i/^ + FJf^diM^ + F^, (29) 

E, := 2 (/°^° M - l 00 S\) d M 0l0 + 2 (V 8\ - W J a M oii 

^f Quasilincar means the coefficient of the highest-order derivatives only depend on derivatives with 
strictly lower order. 
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+ 2[-l^l 0S N^ + 5°/ x l 5x e a W b r 5] 

- l^e a °l\d b r s] + /V^AlA^o/oo 
= Gf^difafi + G^diMafr + G„ (30) 

E := 4 (g ia l ^ - g 0a l°^r - <7 W 7 ) N a ^d M Qi0 

+ 4 (g ia P^ - g 0a l^r - g 0a g 1 ^) N a ^d M 0lJ + ~d f 00 
= H^&U + H^dtM^ + H, (31) 

(32) 



with 



-3a ig 00 - 2P°e a W' 

+ 4N^ x N a ^ {g°H 0a l v n xp + g pa P v l x H^ + g pa g 0u l^g^ x ) 

- 2 ( g ^f x r + ^W 7 - 9^l 0X P u ) N^dpf^ 

- 4/° Q 9 p / a 7 (rn^e bp + e^i^i" - rn^e m \T^ h 



<up~i 



( 2 pY^ e b P + 2P p l IB0 l 0u e bp 



2N viyy 2FTT V + 






+ P p l^e bo l op + 2l m P v l pp e in \T Vifih 
2r] ab P°P°P a + 2rj ab P u P°l 0a 

+ g™e a0 e w l^ + g ua e aX f\l^P°)F^ a F apb , (33) 



where F$f , F^\ F^, Gj* &f\ G„, E ia ^ H ia ^ and H are functions of f af} , M^ A , 
f a a , d^f a a and d^d u f a a , and contain no derivative terms of f a p and M^ v \. Although 
Eqs.( l29l) . ( 1301) and ( I3TT) seem to have fifteen components in total, only ten of the fifteen 
are actually independent time-evolution equations. This can be seen as follows: For 
\i = in Eq.( |29l) . the left hand side becomes zero, and thus it does not give any time 
evolution. Moreover, the trace of the left hand side of Eq.(j2HJ) is proportional to the 
left hand side of Eq. f)30l) with // = 0, which means that the trace does not give any 
independent information for the time evolution. This is consistent with the discussion 
in the previous section, i.e., five of ten degrees of freedom of Eq. ffTQj) should become 
constraints as in Eq.l fTTl) and Eq.( fT2l) . 

To conclude, we now have ten time-evolution equations. We can finally write down 
the characteristic equations. They are: 

is/oo + J2 (g ia i op i 01 - 9 0a i°^ h - s° W 7 ) N aM M m 

i 
i 

+ J2 2 (g ia l j ^ - g 0a i jP r - <?° W 7 ) N a ^M 0(ij) = o, (34) 

i>j 
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l m M 0l0 + l 0l M 0ii + J2^M 0{ij ) 

+ (>/ 0/3 AW - /° 7 eXd [7 A + l fJ Q5 ^f a s) /oo = 0, (35) 

£ (*V*) - i ok g* 3 ) M oko + J2 P V*) - /fcfc ^) ^ 

+ [i\W (-^V*) - g 0(1 i 0u i 0X g iJ - i (lX i 0u g lJ ) 

+ ^°6 fa A) ( Vi) - s°^V; - ^;) ] /oo = o, (36) 

where we explicitly show the summation with respect to spatial coordinates because 
some of the indices do not obey Einstein's summation rule. Here, we decompose M 0i j 
into Mo(y) and Mo[jj], but M [jj] does not appear in the characteristic equations because 
from Eq.( fl6l) we see that Mq^j] can be expressed in terms of Mjjo- Here, the notation 
M^ u x and f 00 means they are not the values of M^ uX and foo, but represent the change 
of the fields in the x°-direction. 

4. Characteristic Equations with Non- Vanishing Contortion 



In Ref . [16] . it was argued that, if the contortion does not vanish, an arbitrary 
hypersurface can be a characteristic hypersurface. If so, a generic solution always has 
acausal structure, and thus the theory is very problematic. With our characteristic 
equations, we revisit this problem. 

We first consider the simplest example with the Minkowski background tetrad. 
Suppose the dynamical tetrad completely coincides with the background tetrad, i.e., 
A = e< V an d let us examine the characteristic determinant corresponding to a flat 
spatial hypersurface. Under this situation, the expressions of g^, g^, f^, l^ u , M^x 
and N^ u x become 

9iw = g^iu = fiiv = Ipu, = Vim, and M ^\ = N^x = 0. (37) 

Then the characteristic equations simplify significantly and reduce to 

- Mon - M 022 = 0, 2M 0(12) = 0, -Mqio = 0, 

- M 022 - M 033 = 0, 2M 0(23) = 0, -M 020 = 0, 

- M 033 - Mon = 0, 2M (3i) = 0, -Moso = 0, --aj 00 = 0. (38) 

The characteristic determinant of this system is obviously generically nonzero and its 
eigenvalues are of order 0(1) and O(cti), which are finite constants. 

Now we consider a tiny perturbation on Minkowski space and consider the changes 
in the characteristic determinant. Since the characteristic determinant on Minkowski 
space is finite and nonzero, by continuity, with a sufficiently small perturbation the 
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characteristic determinant must still be nonzero even if the contortion does not vanish. 
This means that the hypersurface is not a characteristic hypersurface (although for some 
values of the fields, the characteristic matrix could become zero and thus a superluminal 
mode could potentially arise. We comment on this further in section [5]). This is in 
conflict with the conclusion of Ref.|16j. There is thus a need to point out why our 
analysis gives a result that is different from that of Ref . [16] • 

In our analysis, the first order derivative of /oo joins in the characteristic equations, 
while in the analysis of Ref.[T6] it does not. The reason why it does not appear in their 
analysis is that the authors assume all discontinuities across a shock surface only show 
up in the second order derivative of the dynamical metric. However, as we will show 
in Appendix A, in the discussion of characteristic we should define the characteristic 
matrix such that every dynamical field appears in the general solutions. Indeed, except 
for the contribution from dofoo, our result is otherwise consistent with that of Ref.|16j. 
However, since their characteristic matrix does not have the contribution from dofoo, 
its determinant always becomes zero. If we go back to the original viewpoint of the 
Cauchy-Kovalevskaya theorem and the discussion about shock waves, it is reasonable 
for dofoo to appear in characteristic matrix. This is further discussed in Appendix A. 

5. Discussion: Does Non-Linear Massive Gravity Admit Superluminal 
Propagation? 

We have carefully examined the characteristic equations of a special case of non-linear 
massive gravity and showed that Cauchy problem can be well-posed, in the sense that 
not every hypersurface can be a characteristic hypersurface given a physical metric. 
In the viewpoint on the fixed fiducial metric, of course, on an arbitrary hypersurface 
there probably exists specific field values, i.e., the specific physical metrics, which give 
vanishing characteristic determinant. In this sense, any hypersurface potentially can 
be characteristic by tuning values of a physical metric. However, the situation here 
is a bit more subtle than the result in Ref. [16], in which the authors argued that, any 
hypersurface can be characteristic on a given physical metric. The difference is due to the 
contribution of the first derivative term of foo, which should appear in the characteristic 



equation in view of the discussion in |Appendix A 



We emphasize again that our result does not guarantee that the theory is free 
of problems. Indeed, we have merely shown that there exist good non-characteristic 
hypersurfaces, not the absence of superluminal characteristics. One should next examine 
the characteristic matrix carefully to determine if its determinant admits any real root, 
the presence of which would imply the existence of superluminal propagations. The 
existence of such real root will directly affect the discussion of causality. It reduces 
the Cauchy region and is sometimes responsible for giving rise to acausality. In order 
to have a deeper understanding of the causal structure of the theory, we will need an 
elaborate inspection of the characteristics. We shall leave this issue for future work. 

It is probably a good guess that non-linear massive gravity does suffer from 
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superluminal propagation in general and that this might result in acausality. Indeed, we 
have seen tantalizing hints of superluminal propagation on the self-accelerating solution 
[29] . and in fact, instantaneous propagation may arise in the full theory with all the a n 's 
being nonzero [30]. In particular, energy can probably be emitted with infinite speed 
on the self-accelerating background by the helicity-0 mode of the massive graviton. 
In addition to superluminal mode, non-linear massive gravity on the self-accelerating 
solution has the property that although it generically has 5 degrees of freedom [12], in 
the second order action on open FLRW background, perturbative analysis only reveals 
two tensors degrees of freedom [20]. That is, the full theory has different number of 
degrees of freedom compared to some of the low order limits. This could be dangerous 
as the excitations of the extra degrees of freedom may be accompanied by anomalous 
characteristic, as discussed in the context of f(T) gravity [31] and Poincare gauge theory 
[32] . Solutions with such anomalous characteristic is inappropriate to describe our 
Universe. Furthermore non-linear instability has also been pointed out in the self- 
accelerating solution [33] . 

There remain a few puzzles to be resolved. Firstly, a further careful analysis 
is required to check if non-linear massive gravity with {tt n } n=2 3 4 = does admit 
superluminal propagation as argued in Ref.[16], which might well be the case. Secondly, 
it would be interesting to investigate how the superluminal helicity-0 mode arises in the 
full theory and its corresponding characteristic analysis. Solving them, we can assess 
the appropriateness of each solution, and it might give us further constraints on the 
theory. 
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Appendix A. Review of characteristic Analysis a la Cauchy and 
Kovalevskaya 

The analysis of the characteristic is related to the Cauchy-Kovalevskaya theorem, from 
which the propagation of shock-wave front can be obtained. Our discussion here is based 
on the equations that appear in the usual proof of the Cauchy-Kovalevskaya theorem 
(see, e.g., the seminal work of Courant and Hilbert [34] ). Since the naive extension of 
the result in the case involving only a single field to that of a multi-field case is invalid, 
we re- visit the original arguments used in the proof of the Cauchy-Kovalevskaya theorem 
and derive the exact characteristic equations in the multi-field case. 
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Appendix A.l. Single Field 

Suppose we have an n-th order differential equation which is quasilinear for the n-th 
order derivative terms, i.e. 

A a ^ a "d ai ... an <p + A = 0, (A.l) 

where is a field and A ai '" an and A are functions of x^ and derivatives of <fi up to the 
(n — l)-th order. We now introduce a hypersurface S which is orthogonal to £ M . If 

A ai - an U---U = 0, (A.2) 

then the hypersurface S is called a characteristic hypersurface. On the other hand, if 

A a "- an U ■ ■ • U ? 0, (A.3) 

the hypersurface S is called a non- characteristic hypersurface. We have the following 
important result [2?t [28]: 



Cauchy-Kovalevskaya Theorem: If all the coefficients A ai '" an and A are analytic and 
if the hypersurface S is a non- characteristic hypersurface, then there exists a unique 
local analytic solution <fi in the neighbourhood o/S. 

This theorem implies that the uniqueness of time-evolution from a charactaristic 
hypersurface is not guaranteed. Moreover, there is an important fact about 
charactaristics, namely: 

Remark: A charactaristic hypersurface can be a shock-wave front. 

This remark means that there could be non-linear propagation on the characteristic 
hypersurface. It can be intuitively understood as follows: Since we have the n-th 
order differential equation of motion for single field 0, we can introduce, on the initial 
hypersurface So, n numbers of initial conditions for (c^)™ -1 ^' • •, dt<p and where 
£ M := (d/dt) 11 . If the initial hypersurface S is not characteristic, we can derive (d t ) n 4> 
from the equation of motion, and then the values (<9 t ) n-1 0,- • •, d t (j) and after a small 
time evolution from the initial hypersurface So can be uniquely fixed as 

(«9 4 ) fc -V(*o + At) = (d t ) k - l <f>(t ) + (d t ) k <f>(t )At, (l<k< n), (A.4) 

where t = to on the initial hypersurface So- However, if the hypersurface So is 
characteristic, (d t ) n <p cannot be fixed uniquely, which allows a discontinuity in (d t ) n 4> 
and thus non-unique time evolution of (<9f) n-1 0. 

In order to make the discussion in the next subsection clearer, we comment here 
that, in the usual proof of Cauchy-Kovalevskaya theorem [31], a quasilinear n-th order 
differential equation for single field <fi is reduced to a system of quasilinear first order 
differential equations for the multi-field u, where 

/ , d<f> d<p d 2 <j) \ 

U ~ \ (p, dx 0, '" , dx d, dx dx 0, '"r [ } 
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and u includes all partial derivatives of <fi up to the (n — l)-th order. Let m denote the 
number of components of u. Then, Eq. (1A.2J) can be exactly represented by 



B l [u, x] d t u = B l [u, x] di\i + B [u, x] , (A.6) 

where B l [u, x], B % [u, x] and B [u,x] depend on u and x. The first two are m x m 
matrices and the last one is a m- vector. Time evolution of every component in u can be 
obtained if det B l ^ 0. While most of the eigenvalues of B l [u, x] are always nonzero, the 
only nontrivial eigenvalue is the same as A 1 '" 1 in Eq.f ]A.2j) . In the proof of the Cauchy- 



Kovalevskaya theorem, after transforming into Eq.( 1A.5l) . the uniqueness is proved under 
the condition det_B* 7^ (Of course we also need the analyticity condition). 

To conclude, in the Cauchy-Kovalevskaya theorem and in the discussion of shock 
waves, the only important term is A 1 '" 1 . If A 1 '" 1 = 0, it is outside the range 
of applicability of the Cauchy-Kovalevskaya theorem to ensure a well-posed Cauchy 
problem, and a shock wave can indeed propagate on the characteristic hypersurface. 

Appendix A. 2. Multi-Field 

Here, we extend the discussion in the previous subsection to a multi-field case. While 
this can sometimes be extended straightforwardly, it is in general, quite nontrivial. We 
will give examples of both cases in this subsection. 

Appendix A. 2.1. The Simple Case We consider equations for two fields <fi and ip, 

S^d^d v <P + T^d^i) + 5 = 0, (A.7) 

U^d^cj) + V^d^ij) + U = 0, (A.8) 

where S^, T^ u , S, U^ u , V^ and U are all functions of d a <p, (ft, dpi/), ip and x. Picking up 
the higher order derivative terms with respect to t, we have 

M ( t \ = 0, with M := ( S J tt ll ) , (A.9) 

where </> and 1/) mean they are not the values of <fi and i/>, but represent the change of the 
fields in the t-direction. If det M = 0, we cannot derive one of the linear combinations 
of and i/>, and the hypersurface is a characteristic hypersurface, according to the 
discussion in the previous subsection. 

Appendix A. 2. 2. The Nontrivial Case Here, we consider the equations for two fields ( 
and a, 

I^dut + J^a + 1 = 0, (A. 10) 

L = 0, (A.ll) 

where I^ v ', J A " / ', I and L are all functions of d a (, (, a and x. Actually, the equations in 
the non-linear massive gravity theory are very similar to these equations. To obtain the 
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second order differential equation, we operate with <9 M on Eg. (1 A. lip and obtain 

dL :W + §^C + |V + ^- = o. (A.12) 



d(d a () M a " d( ^ da " dx» 
If we naively pick up the higher-order derivative terms, the characteristic eguations are 

**(<)=<>, with R==(4° ). (A-13) 

where ( and a mean they are not the values of £ and a, but represent the change of 
the fields in the t-direction. Then, det N is trivially zero. Does it mean that every 
hypersuface is a characteristic hypersurface? 

In order to investigate this guestion, first we consider a simple example 

L = P + Qo-, (A. 14) 

where P is a function of d a ( and £, while Q is a non-zero constant. Then, Eg. (j A. lip 
can be regarded as a constraint eguation for fixing the auxiliary field a and we can solve 
it for a. Substituting the solution into Eq. flA.lOp . we can obtain an eguation for £: 

QI 1 ™ + r Jfo) W + 1 = 0, (A.15) 

where 7 is a function of d a (, £ and x. Now, Eq. flA.15l) is the eguation for the single field 
( and the charactaristic eguation for ( is obviously 

QI " +J 'mo)^°- (A16) 

which is different from Eq. (lA.13p . From the discussion in the previous subsection, ( is 
uniguely fixed in the neighborhood if 

BP 

Q 1 " + Jt 7^n * °' ^ A - 17 ) 



and then, a can be also determined uniguely from Egs. tlA.lip and ( 1A.14|) . Therefore 



we find that Eq. (IA.16l) is the exact characteristic equation instead of Eq. (IA.13l . 

How do we derive the characteristic equation in a general multi-field case? Going 
back to Eqs.( lA.10l) and ( lA.lip . it can be seen that in both equations, the order of 



derivative operators of a are one less than those of (. This means that discontinuity 
of (d t ) C can De balanced with that of (d t ) ~ a. Therefore, not only the second order 
derivative of ( but also the first order derivative of a should appear in the characteristic 
equation. Then, the characteristic equation can be written as 

N ( C ) = 0, with N := ( J^_ ^ ) , (A.18) 

\ a ) \ 9(dt0 da J 

and the condition of the characteristic is consistent with Eq. flA.16p . 

This statement can be justified further by writing the equations in the form of 
Eq. (IA.6p . Let us introduce new variables 

v t := %(, (A.19) 

u := d t (, (A.20) 
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which are related by the commutativity of partial derivatives: 

dtvt = d iU . (A.21) 

Eqs. ( 1 A. 10|) and ( 1A.12J) can be represented by 

I a d t u + J l d t a = - (I u + I u ) d iU - I ij diVj - TdiO - /, (A.22) 

dL dL dL dL dL 

-d t u + ^—d t o- = - diU - —diC- tt7, (A. 23) 



d(d t C) da L 8(8,0 d( '* 8x t: 

where all coefficients I^ u , J M , /, 8L/8 (8^(), 8L/8(, 8L/8a and dL/dx 1 are written in 
non-derivative variables (, a, u and v,. The combination of Eqs. flA.20p - flA.23p is exactly 
of the form of Eq. (lA.6|) . We can then apply the Cauchy-Kovalevskaya theorem and also 
have the same discussion about shock waves. Again, we can deduce that det N = is 
the condition for the hypersurface to be the characteristic hypersurface. 

Finally, we discuss in brief the method for deriving characteristic equations with 
general multi-fields. We start by choosing one of the fields and define the dimension of 
it as zero. The dimension, denoted by [•], of the differential operator is one, i.e. [d] = 1. 
We determine the dimensions of other fields such that each field necessarily appears in 
one of the characteristic equations. In the case of Eqs. flA.10j) and flA.llj) . for instance, 



we define the dimension of ( as zero. If the dimension of a is larger than one, only d^a 
can appear in the characteristic equations. On the other hand, if the dimension of a 
is smaller than one, only d^dyC, can appear in the characteristic equations. Therefore, 
only in the case where the dimension of a is equal to one, both can appear in the 
characteristic equations, in which case we have the situation discussed above. 
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